Less Is More: Pay Less Attention in Vision Transformers

نویسندگان

چکیده

Transformers have become one of the dominant architectures in deep learning, particularly as a powerful alternative to convolutional neural networks (CNNs) computer vision. However, Transformer training and inference previous works can be prohibitively expensive due quadratic complexity self-attention over long sequence representations, especially for high-resolution dense prediction tasks. To this end, we present novel Less attention vIsion (LIT), building upon fact that early layers still focus on local patterns bring minor benefits recent hierarchical vision Transformers. Specifically, propose where use pure multi-layer perceptrons (MLPs) encode rich stages while applying modules capture longer dependencies deeper layers. Moreover, further learned deformable token merging module adaptively fuse informative patches non-uniform manner. The proposed LIT achieves promising performance image recognition tasks, including classification, object detection instance segmentation, serving strong backbone many Code is available at https://github.com/zip-group/LIT.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Less is more: compact genomes pay dividends.

In 1993, Sydney Brenner, like many others, recognized that vertebrates are distinct in their morphology and development and that access to the complete sequence of a vertebrate genome would yield valuable insights into the biology of higher species not obtainable from genome studies of yeast, fly, or even the nematode. Moreover, at that time it was not possible, through sequencing technology, t...

متن کامل

Less is more… (more or less…).

In April 1981 Xerox introduced the Star 8010 workstation, the first commercial system with a Graphical User Interface (GUI) and the first to use the “desktop” metaphor to organize a user’s interactions with the computer. Despite the perception of huge progress, from the perspective of design and usage models, there has been precious little progress in the intervening years. In the tradition of ...

متن کامل

Less Is More, More the Merrier, or More From Less?

P hysicians deal with uncertainty all the time and chest pain in the emergency department (ED) is a typical example. Traditionally, coronary artery disease (CAD), pulmonary embolism (PE), and aortic dissection can present as chest pain, and the consequences of a missed diagnosis can be devastating, with the potential for rapid deterioration, and serious risk of morbidity and mortality. Moreover...

متن کامل

Less Is More?

Judges in the United States, the United Kingdom, and Canada have ruled that witnesses may not wear the niqab—a type of face veil—when testifying, in part because they believed that it was necessary to see a person’s face to detect deception (Muhammad v. Enterprise Rent-A-Car, 2006; R. v. N. S., 2010; The Queen v. D(R), 2013). In two studies, we used conventional research methods and safeguards ...

متن کامل

Less is more.

Copyright 2012 by the National Academy of Sciences. All rights reserved. The views expressed in this commentary are those of the author and not necessarily of the author’s organization or of the Institute of Medicine. The commentary is intended to help inform and stimulate discussion. It has not been subjected to the review procedures of the Institute of Medicine and is not a report of the Inst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i2.20099